339 results found.
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic Bengali Dari Egyptian Arabic English Georgian Hindi Iranian Persian Italian Japanese Khmer Korean Lao Mandarin Chinese Min Nan Chinese Moroccan Arabic Panjabi Persian Russian Spanish Tagalog Thai Tigrinya Urdu
Availability:
From Owner
License:
LDC
Size:
640 hoursProduction Status:
Existing-used
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2008 NIST Speaker Recognition Evaluation | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
Arabic Bengali Dari Egyptian Arabic English Georgian Hindi Iranian Persian Italian Japanese Khmer Korean Lao Mandarin Chinese Min Nan Chinese Moroccan Arabic Panjabi Persian Russian Spanish Tagalog Thai Tigrinya Urdu
Availability:
From Owner
License:
LDC
Size:
950 hoursProduction Status:
Existing-updated
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2008 NIST Speaker Recognition Evaluation Training Set Part 2 | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Emotion Recognition/Generation
-
Paper title:Conversational and Social Laughter Synthesis with WaveNet
-
Paper track:3.6 Social signal processing/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hiroki Mori | Online gaming voice chat corpus with emotional label (OGVC) | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
From Data Center(s)
License:
ATR
Size:
None Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
-
Paper track:8.8 Acoustic model adaptation (e.g. bandwidth, emo/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Emiru Tsunoo | ATR-503 | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Not Available
License:
Size:
130 sessions of interactions OtherProduction Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Turn-taking Prediction Based on Detection of Transition Relevance Place
-
Paper track:11.1 Spoken dialog systems/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kohei Hara | ERATO Human-Robot Interaction Corpus | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Not Available
License:
Size:
130 sessions of interactions OtherProduction Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Analysis of effect and timing of fillers in natural turn-taking
-
Paper track:11.12 Other topics in Spoken dialog systems and co/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Divesh Lala | ERATO Human-Robot Interaction Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Cantonese English French German Gishu Greek Gujarati Hebrew Hindi Indonesian Japanese Korean Mandarin Persian Portuguese Runyankore Russian Spanish Turkish Vietnamese
Availability:
Freely Available
License:
OpenSource
Size:
22.8 GByte Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:Speaking rate, information density, and information rate in first-language and second-language speech
-
Paper track:1.10 Bilingual and L2 acquisition and processing/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ann Bradlow | The ALLSSTAR Corpus | /N |
Documentation:
Documentation in English is available to the public (via the project website)
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese English Japanese
Availability:
From Data Center(s)
License:
Size:
None Production Status:
Existing-used
Use:
Text Mining
-
Paper title:Stochastic Tokenization with a Language Model for Neural Text Classification
-
Paper track:Long/Sentiment Analysis and Argument Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tatsuya Hiraoka | NTCIR-6 Opinion | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Japanese
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-updated
Use:
-
Paper title:Multi-Source Cross-Lingual Model Transfer: Learning What to Share
-
Paper track:Long/Multilinguality
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Xilun Chen | Cross-Lingual Sentiment Dataset | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Not Available
License:
N/A
Size:
3680 entries Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Diverse and Non-redundant Answer Set Extraction on Community QA based on DPPs
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shogo Fujita | Yahoo! Chiebukuro diversified answers dataset | /N |
Documentation:
N/A




